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TO ALL WHOM IT MAY CONCERN: 

Be it known that we, Chris B. Curtis 
a citizen of the United States of America, residing at 7339 E. McKinley St., 
Scottsdale 85257 and State of Arizona; and 
Joseph T. Marino, Jr. 

a citizen of the United States of America, residing at 1861 Idylwild Way, 
Prescott 86305 and State of Arizona; and 
Bruce A. Fette 

a citizen of the United States of America, residing at 2310 W. Dei Campo, 
Mesa 85202 and State of Arizona; 

have invented a new and useful METHOD AND APPARATUS FOR TESTING 
DIGITAL CHANNELS IN A WIRELESS COMMUNICATION SYSTEM, of which 
the following is a specification. 



METHOD AND APPARATUS FOR TESTING DIGITAL CHANNELS IN A 
WIRELESS COMMUNICATION SYSTEM 

Field of the Invention 

5 The present invention relates to wireless communication systems, and more 

particularly, to a method and an apparatus for audio testing digital channels in a wireless 
communication system. 

Background of the Invention 

10 Wireless communication systems that utilize digitally coded communication 

signals are known in the art. One such system is a direct sequence Code Division 
Multiple Access (DS-CDMA) cellular communication system, such as set forth in the 
Telecommunications Industry Association Interim Standard 2000 (TIA IS-2000) herein 
after referred to as IS-2000. In accordance with IS-2000, the coded communication 

1 5 signals used in the DS-CDMA system include signals that are transmitted in a common 
channel, typically a 1.25 MHz bandwidth common channel, between mobile stations 
(MS) and base transceiver stations (BTS) located at the base sites (BS) of a wireless 
communication system. 

A digital wireless communication system is a complex network of elements and 

20 their interconnections and protocols. Typically elements include (1) a radio link to the 
mobile stations (e.g., cellular telephones), which is usually provide by at least one and 
typically several base stations, (2) communication links between the base stations, (3) a 
controller, typically one or more base station controllers or centralized base station 
controllers (BSC/CBSC), to control communication between and to manage the operation 

25 and interaction of the base stations, (4) a call controller or switch, typically a call agent 
(i.e., a "softswitch"), for routing calls within the system, and (5) a link to the land line or 
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public switch telephone network (PSTN), which is usually also provided by the call 
agent. 

Optimization of performance efficiency of a wireless communication system is 
generally accomplished through fault detection testing. Of particular importance in a 
5 wireless communication system is the quality of a digital voice channel path. The typical 
digital voice channel path may include the over-the-air interface between a source mobile 
station and a destination mobile station as well as all of the infrastructure used to connect 
the source mobile station to the destination mobile station, for example the BTS, the 
transcoder in the CBSC, and the MSC. Various types of fault detection testing have been 

1 0 used in an attempt to detect problems in the digital voice channel path. The most well 
known testing method, commonly referred to as the analog system audio test, or SALT, 
was developed originally for use in analog communication systems. The SALT test 
utilizes special test equipment to inject a single-frequency audio tone into a previously 
established voice call. The single-frequency audio tone is looped back to the sending 

1 5 equipment where its frequency and audio level are measured. The test is considered to be 
successful (indicating a working channel ) if the frequency and audio level are reasonable 
close to the frequency and audio level of the injected tone. SALT however, has a number 
of problems when applied to a digital based wireless communication system. First, a 
single-frequency audio tone is not reproduced well by the digital infrastructure of a 

20 wireless communication system. As expected, vocoders in the call channel distort a 
single-frequency audio tone in completely unpredictable ways. In addition, the echo- 
cancelling and noise suppression devices in the call channel treat the single-frequency 
audio tone as noise or echo, and consequently attempts to filter or suppress it. Various 
combinations of tones have been tried in an attempt to overcome the aforementioned 

25 problems, however none have been found to produce better than about 95% accuracy with 
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respect to a comparison of the injected tone(s) with the frequency and audio level of the 
looped back tone. Second, there is a settling time associated with the use of single 
frequency audio tones such that approximately one minute is required to test each voice 
channel and there are generally hundreds of voice channels associated with each base 
5 station. In order to save time, additional test devices in the form of costly specially 
designed test equipment are needed. However, there is a physical limit to the number of 
test devices that can be connected to the cellular system at one time. 

A second test method, referred to as a Markov test is also used for fault detection 
on voice channels. It requires a special test mobile station to initiate a "markov call" 
10 which requires normal vocoding to be disabled and the normal call processing model to 
be significantly modified. As a result the Markov test only verifies a portion of the voice 
channel; the air interface between the network vocoders and the special test mobile 
station. 

A third test method referred to as Perceptual Speech Quality Measurement test, 
1 5 defined in the ITU P. 861 "Series P: Telephone Transmission Quality" specification, has 
been suggested as a alternative to SALT. The purpose of PSQM test is to measure the 
perceived quality of POTS telephony vocoders in an automated fashion, rather than 
having to rely on a statistical analysis of responses of human test subjects as is done to 
Mean Opinion Scores (MOS). In general, the PSQM test works by modeling the way 
20 which humans perceive the quality of speech. Test samples are injected into the PSQM 
model which generates a set of parameters from the test samples. The parameters are 
then forwarded to a mapping function where they are evaluated and assigned a Mean 
Opinion Score (MOS). The PSQM test however is not intended for use in a wireless 
communication environment. Further, even if it were applicable to wireless 
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communication environments, it would require complex and expensive hardware 
modifications to test equipment currently used for fault detection on voice channels. 

A fourth method used for fault detection on a voice channel utilizes a recorded 
reference speech sample (original recorded speech sample) which is injected into the call 
5 path of a first special test mobile station and received by a second special test mobile 
which records the resulting speech. In the Speech Waveform Matching test, the recorded 
resulting speech is compared to the original reference speech sample and the degree to 
which they match is measured. However, the process of vocoding does not necessarily 
reconstruct a replica that looks like the original recorded speech sample. Instead, 

10 vocoding reconstructs a replica that sounds like, to a human listener, the original recorded 
speech sample. Thus, the degree to which the original reference speech sample and the 
recorded resulting speech sample, matched, would always be skewed by the very 
presence of the vocoders in the voice channel path. In addition, the accuracy of the 
Speech Waveform Matching test is a function of the time it takes to perform the test. A 

15 longer recorded resulting speech sample yields a better correlation result than a shorter 
recorded resulting speech sample. However, the longer recorded resulting speech 
samples require more time to collect. Further, the special test mobile stations lack 
sufficient memory capacity to accommodate both the original reference speech sample 
and the longer recorded resulting speech. 

20 
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Brief Description of the Drawings 

FIG. 1 depicts a DS-CDMA wireless communication system. 
FIG. 2 is high-level block diagram of an embodiment of a digital channel test 
configuration. 

5 FIG. 3 is a more detailed block diagram showing one embodiment of a frame 

correlation error generator illustrated in FIG. 2. 

FIG. 4 is a pictorial representation of a first set of line spectrum pair frequencies 
described in connection with FIG. 3. 

FIG. 5 is a pictorial representation of a second set of line spectrum pair 
10 frequencies described in connection with FIG. 3. 

FIG. 6 is a more detailed block diagram showing another embodiment of a frame 
correlation error generator illustrated in FIG. 2. 

FIGs. 7 and 8, when joined along similarly lettered lines, together form a 
flowchart of a method for audio testing digital channels in a wireless communication 
15 system. 

FIG. 9 is a more detailed flowchart of a method for calculating a multiplicity of 
frame correlation metrics used to detect a fault in the digital channel illustrated in FIG. 2. 

Detailed Description of the Preferred Embodiments 

20 While the present invention is susceptible of embodiment in many different forms, 

there are shown in the drawings and will be described herein in detail specific 
embodiments thereof, with the understanding that the present disclosure is to be 
considered as an exemplification of the principles of the invention and is not intended to 
limit the invention to the specific embodiments illustrated. 
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As mentioned described above, a need remains for a method and apparatus to 
perform fault detection testing of a digital voice channel path in a digital wireless 
communication system environment. 

In accordance with an embodiment of the invention, a method and apparatus for 
determining a fault in a digital channel, operates in a digital wireless communication 
system. The digital wireless communication system may include a multiplicity of mobile 
stations, a multiplicity of base transceiver stations, each of the multiplicity of base 
transceiver stations in communication with at least one of the mobile stations, a 
transcoder configured to provide a signal conversion capability between vocoder frames 
and pulse code modulation, and a mobile switching center for interconnecting the digital 
wireless communication system to a public switched telephone network. A source test 
mobile station generates a first set of vocoder input parameters. The first set of vocoder 
input parameters results from a speech input signal which has been vocoder encoded. A 
transcoder generates a second set of vocoder input parameters. The second set of vocoder 
input parameters results from an output signal substantially equivalent to the speech input 
signal as it is received at a destination mobile station via the digital channel. A metric is 
calculated based on the first set of vocoder input parameters and the second set of 
vocoder input parameters, and a fault is determined in the digital channel using the 
metric. 

Turning now to the drawings, wherein like numerals designate like components, 
FIG. 1 depicts a CDMA wireless communication system 100. The CDMA wireless 
communication system 1 00 preferably comprises a direct spread code division multiple 
access (DS-CDMA) cellular communication system, although a multi-carrier CDMA 
cellular communication system may be suitable. 
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The CDMA wireless communication system 100 includes base stations 102, 104, 
106 providing service to coverage areas 122, 124, and 126 respectively, and one or more 
mobile stations although only one mobile station 130 is shown. The base stations 102 and 
106 include, among other things, a processor 140 and a memory 150. Similarly, base 
5 station 104 includes, among other things, a processor 142 and a memory 152. Base 
transceiver station 102 includes a transceiver (not shown) that transmits coded 
communication signals to, and receives coded communication signals from, the mobile 
station 130 via an antenna 108. A receiver, preferably a RAKE receiver within the 
transceiver provides tracking and demodulation capability of incoming multipath coded 

10 signals from mobile stations. Similarly, the mobile station 130 includes a transceiver that 
transmits coded signals to, and receives coded signals from, the base station 102 within a 
coverage area 122. In addition to a transceiver, the mobile station 130 includes an 
encoder speech codec which converts coded voice signals to electronic signals, typically 
via a code excited linear prediction (CELP) vocoder model, in order to compress the 

15 coded voice signal prior to transmission. The received electronic signals are then used by 
a decoding vocoder to reconstruct the coded voice signals at the receiving end. 

The base stations 102, 104, and 106 are coupled to a base station controller (BSC) 
162, which includes, among other things, a processor 144 and a memory 154, and which 
is in turn coupled to a mobile switching center (MSC) 160, also including a processor 146 

20 and a memory 156. The MSC 160 is coupled to a public switched telephone network 
(PSTN) 164 using known techniques. 

A mobile communication signal 132 is conveyed between the mobile station 130 
and the base station 102 through a radio frequency (RF) channel. The RF channel 
includes a reverse-link (mobile station 130 to base station 102) and a forward- link (base 

25 station 102 to mobile station 130). Multipath replicas 136 and 134, or copies of mobile 
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communication signal 132 resulting from of signal reflections off of nearby scatterers 
such as building 138, may also be conveyed between mobile station 130 and the base 
station 102. 

Signal transmission systems require a certain minimum frequency response in 
5 order to transmit a given signal. In order to achieve such a frequency response, the 

bandwidth of the signal transmission system must be at least twice the bandwidth of the 
signal being transmitted; a luxury not available in a wireless communication system. As 
a result of the bandwidth requirement in traditional signal transmission, a variety of audio 
coding methods were developed for use in wireless communication systems such as the 

10 CDMA wireless communication system 100. The central objective in audio coding, or 
compression, is to represent a signal with a minimum number of bits while generating an 
output audio signal that cannot be distinguished from the original input audio signal. This 
signal representation may be accomplished through the use of speech synthesis 
algorithms, for example a CELP vocoder algorithm. DS-CDMA utilizes numerous CELP 

1 5 vocoder algorithms to perform audio coding of variable rate speech. 

CELP vocoder algorithms used in wireless communication mobile stations operate 
by establishing an electronic model of the human vocal system. Using the CELP 
algorithm, human speech is converted to electronic signals by a process referred to as 
encoding. Transmission of the electronic signals require significantly less bandwidth than 

20 transmission of human voice. At the receiving end, the electronic signals are decoded 

and used to reconstruct the human speech. Thus, although the speech at the receiving end 
sounds like transmitted human voice, it is merely a reconstruction of human voice, 
utilizing the received electronic signals. 

An example of a CELP vocoder used for speech decoding in CDMA wireless 

25 communication system 100 is set forth in the Telecommunications Industry Association 
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Interim Standard 96-B (TIA IS 96-B) herein after referred to as IS-96B. In general, the 
CELP vocoder uses a codebook and a pseaudoramdom noise generator to vector-quantize 
the original voice signal using an analysis-by- synthesis method. To accomplish speech 
encoding , the CELP encoder samples the input speech, converts it to a digital format, and 

A A A A A A 

5 determines a set of input parameters /, G, L,b,al...alO , to be used in the CELP decoding 
procedure. The set of input parameters is selected to minimize the perceptual difference 
between the original speech being input to, and the synthesized speech being output from 
the CELP vocoder. These input parameters, determined from the speed of the voice 
speech, its pitch, its gain, etc., attempt to model the creation of voice by the vocal tract. 
1 0 Once determined, the input parameters are quantized and packed into data packets for 
transmission. To accomplish speech decoding, or reconstruction of the original speech 
at the other end, the data packets are unpacked, the received input parameters are 
unquantized, and the speech input signal is reconstructed as a synthesized speech output 
signal. Line spectrum pair (LSP) frequencies derived from linear predictive coding 

A A 

1 5 coefficients a 1. . . a 1 0 previously determined by the encoding procedure, are used to 
reproduce the speech signal. Accordingly, during the decoding process, the LSP 

A A 

frequencies are mathematically converted back to formant synthesis parameters al...alO . 
Thus, based on the received input parameters, a receiver in conjunction with a vocoder 
decoder, attempts to model the original input speech using the information contained in 
20 the input parameters to produce the synthesized speech output. 

As previously described, various fault detection techniques have been utilized in 
an attempt to optimize digital voice channel path quality. The current techniques, 
however, require lengthy test times, require complex and expensive hardware 
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modifications, do not take advantage of existing communication system properties, and 
are generally not well suited for application to digital voice channels. 

A high-level block diagram of an exemplary digital channel test configuration 
200 capable of employing the teachings of the present invention is illustrated in Fig. 2. 
5 Digital channel test configuration 200 utilizes the line spectrum pair (LSP) frequencies 

derived from the formant synthesis parameters a 1... a 10, to perform fault detection of a 
digital voice channel path, according to an embodiment of the present invention. A series 
of frame correlation error metrics are calculated from (1) the LSP frequencies generated 
by the encoding of a speech input signal, and (2) the LSP frequencies generated from the 

10 output signal, the output signal formed by decoding and re-encoding the speech input 
signal as it passed through a digital voice channel path. The series of frame correlation 
error metrics are used to determine whether a the digital voice channel path is acceptable. 

Digital channel test configuration 200, includes a source BTS 302, a destination 
BTS 304, a transcoder 306, a MSC 308, a PSTN 310, and a frame correlation error 

1 5 generator 340. Source BTS 302 further includes a source antenna 316 and a source test 
mobile station (TMS) 312 having source TMS software including vocoder encoder and 
decoder capability, while destination BTS 304 further includes a destination antenna 318 
and a destination TMS 314 having destination TMS software including vocoder encoding 
and decoding capability. Although a source TMS and destination TMS are depicted, any 

20 type of radio frequency diagnostic subsystem with mobile station capability may be used 
in digital channel test configuration 200. 

A speech input signal 319, transmitted over-the-air from source TMS 312 is 
received by source antenna 316. Speech input signal 319 is formed by vocoder encoding 
a speech signal received by a mobile station and is preferably composed of a generated, 

25 vocoder encoded data frame sample that can be decoded and then re-encoded while 



PATENT 
29259/CE08452R 

keeping the amount of distortion of the signal to a minimum. For example, the vocoder 
encoded data frame sample may be generated from a vowel sound (i.e. /eh/) with a pitch 
period that is an integer per frame (80 sample pitch for a 160 sample frame). The vocoder 
data frame sample may also be composed of forty frames; ten frames of the vowel sound 
5 followed by 10 frames of "silence" where the vocoder shifts to 1/8 frames speed, 

followed by 10 frames of the vowel sound, and followed by 10 more frames of silence. 

A A 

A set of LSP frequencies, a first set, calculated from input parameters al...al0, 
resulting from vocoder encoding the speech signal, is forwarded to frame correlation error 
generator 340. The set of LSP frequencies generated from 40 frames of encoded speech, 
10 includes many groups of LSP frequencies. Each group of LSP frequencies contains 10 
LSP frequencies. 

Next, speech input signal 319 is demodulated and receiver decoded by a receiver 
(not shown) at source BTS 302. A resulting signal 321, still vocoder encoded, is 
transmitted to transcoder 306 via a Tl span. Transcoder 306 uses the original set of input 

15 parameters I, G,L, b, a 1... a 10, including the 10 LSP frequencies per frame, to convert 

resulting signal 321 to a pulse code modulation (PCM) signal 323 suitable for receipt and 
transmission by MSC 308. Thus, transcoder 306 vocoder decodes speech input signal 
319. MSC 308, after having switched PCM signal 323 to an appropriate link, outputs a 
PCM signal 325 to transcoder 306. Transcoder 306 converts PCM signal 325 to a 

20 vocoder frame signal 327 suitable for receipt by destination BTS 304. Transcoder 306 

\ A A A A A 

generates a second set of input parameters I ,G,L,b,a\...a\0 , including 10 LSP 
frequencies per frame, which are theoretically identical to the original set of input 
parameters. Upon receipt, a receiver at BTS 304 modulates and receiver encodes 
vocoder frame signal 327 to produce an output signal 329, which is transmitted over-the- 
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air via antenna 318 to destination TMS 314. Output signal 329 results from decoding and 
re-encoding speech input signal 319 as it travels from antenna 316 to destination TMS 
314 through digital channel test configuration 300. Upon receipt of output signal 329 by 
destination TMS 314, a set of LSP frequencies, a second set, calculated from input 

5 parameters al...a\0, describing output signal 329, is forwarded to frame correlation error 
generator 340. Again, the set of LSP frequencies generated from 40 frames of encoded 
speech, includes many groups of LSP frequencies. Each group of LSP frequencies 
contains 10 LSP frequencies. The first set of LSP frequencies from input speech signal 
319, and the second set of LSP frequencies from output signal 329, are mathematically 

10 compared using methods discussed in connection with Fig. 3 to determine if the digital 
voice channel path is acceptable. 

A more detailed block diagram of an embodiment of frame correlation error 
generator 340 is illustrated in Fig. 3. In this embodiment, frame correlation error 
generator 340 includes a plurality of modules 502, 504, 506, 508, and 521. Each of these 

1 5 modules may be implemented by a microprocessor executing software instructions and/or 
conventional electronic circuitry. In addition, a person of ordinary skill in the art will 
readily appreciate that certain modules may be combined or divided according to 
customary design constraints. 

For purposes of determining whether a digital voice channel path is acceptable, a 

20 series of frame correlation error metrics are calculated and compared by frame correlation 
error generator 340. As previously discussed in connection with Fig. 2, a frame 
correlation error metric is calculated using the one group's worth of 10 LSP frequencies 
resulting from speech input signal 319, and one group's worth of 10 LSP frequencies 
resulting from output signal 329. An LSP frequency extractor module 502 extracts a first 

25 set of LSP frequencies 501 from speech input signal 319, the first set of LSP frequencies 
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composed of groups of LSP frequencies, each group composed of 10 LSP frequencies. 
Fig. 4 is a pictorial representation of the first set of LSP frequencies described in 
connection with Fig. 3. As shown in Fig.4, the first set of LSP frequencies 501 is 
represented as a series of groups of ten LSP frequencies, each group often LSP 
5 frequencies derived from a corresponding encoded vocoder frame. Thus, the first group 
530 of the first set of LSP frequencies 501 includes 10 LSP frequencies corresponding 
with frame one of the speech input signal, the second group 532 of the first set of LSP 
frequencies 501 includes 10 LSP frequencies corresponding to frame two of the speech 
input signal, the third group of ten LSP frequencies includes 10 LSP frequencies 
1 0 corresponding to frame three of the speech input speech signal, and the m th group 534 of 
the first set often LSP frequencies 501 includes 10 LSP frequencies corresponding to 
frame m of the speech input speech signal. Although the first set of LSP frequencies, as 
described, is calculated by formant synthesis filter 210 from linear predictive coding 

A A 

coefficients al...al0, derived from the actual input speech signal received by source 
1 5 TMS 3 1 2, the first set of LSP frequencies may also be calculated from linear predictive 

coding coefficients a\...a\0, selected to model an input speech signal. 

LSP frequency extractor module 502 also extracts a second set of LSP frequencies 
503 from output signal 329 . The second set of LSP frequencies is extracted from output 
signal 329 prior to decoding output signal 329 by destination TMS 314. Fig. 5 is a 
20 pictorial representation of second set of LSP frequencies 503 described in connection 

with Fig. 3. Second set of LSP frequencies 503 is represented as a series of groups often 
LSP frequencies, each group of ten LSP frequencies derived from a corresponding 
vocoder frame. Thus the first group 540 of the second set of LSP frequencies 503 
includes 10 LSP frequencies corresponding with frame one of the output signal, the 
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second group 542 of the second set of LSP frequencies 503 includes 10 LSP frequencies 
corresponding to frame two of the output signal, the third group of the second set of ten 
LSP frequencies includes 10 LSP frequencies corresponding to frame three of the output 
speech signal, and the nth group 544 of the second set often LSP frequencies 503 
5 includes 10 LSP frequencies corresponding to frame n of the output speech signal. 

Next, a frame aligner module 504 aligns first group 530 of the first set of LSP 
frequencies with first group 540 of the second set of LSP frequencies. The first group, or 
frame, of the first set of LSP frequencies 530 includes 10 LSP frequencies 
LSPj ! (1), LSP X , (2), LSP l [ (3). .. LSP l , (1 0) . Similarly, the first group, or frame, of the 

1 0 output signal includes 1 0 LSP frequencies, LSP n (1), LSP 2] (2), LSP 2i (3). . . LSP 21 (10). 

A frame correlation calculator 506 then calculates a first-frame correlation error 
507 using the 10 LSP frequencies from first group 530 of the first set of LSP frequencies 
and the 10 LSP frequencies from first group 540 of the second set of LSP frequencies as 
follows. First, the difference between LSP 2l (Y) and LSP n (l) is calculated to form a 

1 5 difference value. The difference value is squared and multiplied by a weight factor to 
yield a first weighted LSP value. The weight factor selected corresponds to the LSP 
frequencies used to calculate the difference. The weight factors are preselected and 
represent the relative subjective importance of each of the 10 LSP frequencies to a 
listener. The weight factors may be selected using a variety of methods including 

20 performing experimental runs of weighted voice samples and noting a listener's audio 
perception of the voice sample. 

For example, Table 1 shows a set of pre-selected weight factors corresponding to 
LSP frequencies 1-10, generated by experimentation, and which may be used to calculate 
a frame correlation error. 
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W(l) 


0.125 


W(6) 


0.094 


W(2) 


0.156 


W(7) 


0.094 


W(3) 


0.125 


W(8) 


0.063 


W(4) 


0.125 


W(9) 


0.063 


W(5) 


0.094 


W(10) 


0.063 



Table 1 

Next, the difference between LSP 21 (2) and LSP U (2) is calculated. As before, the 
difference is squared and multiplied by the weight factor to yield a second weighted LSP 
5 value. Again, the weight factor is selected based on the LSP frequencies used to calculate 
the difference. The process of calculating the difference, squaring, and then multiplying 
by a weighting factor continues eight more times until the difference between LSP 2l (\0) 
and LSP U (10) is calculated, squared, and multiplied by a weighting factor, thereby 
yielding 10 weighted LSP values denoted as a first weighted LSP value, a second 
10 weighted LSP value, ... a tenth weighted LSP value. Second, the 10 weighted LSP values 
are summed together, forming a sum LSP value. Next, the sum LSP value is divided by 
10 to form a mean LSP value. Finally, the square root of the mean LSP value is taken, 
thereby yielding first-frame correlation error metric 507. Frame correlation metric 507, a 
weighted root-mean-square correlation error average, represents a spectral distortion 
1 5 measurement of the input signal as it travels through the digital channel. 

The frame correlation error metric calculation may be represented as follows: 

fcw(iiLSP M (i)- LSP m (i)] 2 
\ — = Frame Correlation Error Metric 
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wherein, LSP oul (i) is a group of 10 LSP frequencies from the second set of LSP 
frequencies , each of the 10 LSP frequencies having a value denoted as (0 and LSP m (i) is 
a group of 10 LSP frequencies from the first set of LSP frequencies, each of the 10 LSP 
frequencies having a value denoted as (z) , and W(i) is a weight factor value, each of the 
5 weight factor values pre-selected for each of the 10 LSP frequencies. 

In order to achieve an accurate frame correlation error, the frames of the input 
speech signal should substantially align with their corresponding frames of the output 
speech signal. Because, however, any digital wireless communication channel such as 
digital channel test configuration 300 may drop frames and/or introduce time delay, 

1 0 spectral distortion may occur and the first frame of the input speech sample may not align 
with the first frame of the output speech signal. To accommodate the possibility of 
dropped frames and/or introduced delays, repeated calculations of frame correlation error 
metric 507 are calculated using the algorithm described above except the output signal is 
realigned with the input signal, frame by frame, such that first group of the second set of 

15 LSP frequencies 540 is aligned with second group of the first set of LSP frequencies 532, 
and so on. Thus, for calculation of a second- frame correlation metric, the first group of 
the second set of 10 LSP frequencies corresponding with frame one of the output signal 
second { LSP 21 (\),LSP 21 (2),LSP 2l (3)...LSP 2] (10) } is compared to the second group of the 
first set of 1 0 LSP frequencies corresponding to frame two of the input signal 

20 { LSP U (1), LSP U (2), LSP n (3). . . LSP U (1 0) } rather than first group of the first set of 1 0 

LSP frequencies corresponding to frame one of the input signal. In other words, the first 
frame of the output speech signal is aligned with the second frame of the input speech 
signal and the second-frame correlation metric is calculated. The process of comparing 
the first frame of the output speech signal to each frame consecutive frames of the input 
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speech signal continues until the midpoint frame of the speech input signal is reached, 
each comparison yielding a frame correlation error metric 507. The frame correlation 
metric from among first-frame correlation metric, second-frame correlation metric, third- 
frame correlation metric, ... m/2 -frame correlation metric, having the smallest value, is 
5 then selected by a frame correlation selector module 508. A resulting minimum frame 
correlation metric 5 1 1 corresponds the optimum alignment of frames between the speech 
input signal 319 and the output signal 329. 

To determine whether the digital voice channel path is acceptable, minimum 
frame correlation metric 5 1 1 is compared to an expected threshold value in a threshold 

10 indicator 512. The expected threshold value is a preselected value based on the 

configuration of digital channel test configuration 300. If minimum frame correlation 
metric 51 1 is larger than the expected threshold value, output speech signal 329 contains 
an unacceptable amount of distortion, indicating that the digital voice channel path is not 
acceptable. However, if minimum frame correlation metric 51 1 is less than, or equal to 

15 the expected threshold value, output speech signal 329 contains an acceptable amount of 
distortion, indicating that the digital voice channel path is acceptable. 

In another embodiment, each frame correlation error metric is compared to the 
previously calculated frame correlation error metric. Between the previous and current 
frame correlation error metric, the frame correlation metric having the smallest value is 

20 forwarded to threshold indicator 521. For example, the first-frame correlation metric is 
compared to the second- frame correlation metric, the second-frame correlation metric 
having a larger value than the first-frame correlation error metric. The second-frame 
correlation metric is then discarded, and the remaining first-frame correlation metric is 
compared to the next, or third-frame correlation error metric. Such an iterative approach 

25 to selecting the smallest value from two values precludes the need for gathering and 
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storing the first- frame correlation metric, the second-frame correlation metric, the third- 
frame correlation metric, ... m/2-frame correlation metric, prior to selecting the frame 
correlation metric having the smallest value. 

A more detailed block diagram of another embodiment of frame correlation error 
5 generator 340 is illustrated in Fig. 6. Frame correlation error generator 340 may be a 
personal computer (PC), a personal digital assistant (PDA), an Internet appliance, a 
mobile station including either the source TSU or the destination TSU, or any other 
communication device. Frame correlation error generator 340 includes a controller 342 
which preferably includes a central processing unit (CPU) 344 electrically coupled by 

10 address/data bus 346 to a memory device 348 and an interface circuit 350. CPU 344 may 
be any type of well known CPU, such as an Intel PentiumTM processor, or a Motorola 
PowerPC. Memory device 348 preferably includes volatile and non-volatile memory. 
Preferably, memory device 348 stores a software algorithm that interacts with digital 
channel test configuration 200 as described below. Memory device 348 may also store 

15 digital data indicative of inputs from digital channel test configuration 300, for example 
LSP frequencies and codes, as well as documents, files, programs, web pages, etc. 
retrieved from a server (not shown). In addition, memory device 348 may also store 
digital data loaded via an input device 352. 

Interface circuit 350 may be implemented using any type of well known interface 

20 standard such as an Ethernet interface and/or a Universal Serial Bus (USB) interface. 

One or more input devices 352 may be connected to interface circuit 350 for entering data 
and commands by an operator into controller 342. For example, input device 352, 
preferably located at an operations and maintenance center for radio sub-system (OMC- 
R), may be a keyboard, mouse, touch screen, track pad, track ball, isopoint, and/or a voice 

25 recognition system. 
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One or more displays, printers, speakers, and/or other output devices 354 may also 
be connected to controller 342 via interface devices 350. Display 354 may be a cathode 
ray tube (CRTs), liquid crystal displays (LCDs), or any other type of display. Display 
354 may generate visual displays of data generated during operation of frame correlation 
5 error generator 340, and/or may include prompts for human operator input, calculated 
values, detected data, etc. 

Frame correlation error generator 340 may also exchange data with other devices 
via any type of network connection 331 to digital channel test configuration 200. 
Network connection 331 may be any type of network connection such as one or more 
: 1 0 twisted pairs of copper wire, an Ethernet connection, a digital subscriber line (DSL, a 
coaxial cable, a fiber optic cable, etc. 

In addition to providing a comparison value to be compared with an expected 
threshold value, the minimum frame correlation error can provide a "fingerprint" of a 
faulty element or link of the digital channel test configuration 200. This is accomplished 
15 by comparing the correlation error to specific ranges. If the comparison falls within a 
specific range window, it implies that a specific subsystem of effect is responsible for an 
observed fault. 

Referring to FIG. 7, a method 400 of audio testing digital channels in a wireless 
communication system begins at step 402 where a first set of vocoder input parameters is 

20 generated from a speech input signal 319. The speech input signal 319 results from 
vocoder encoding a speech signal received at a source mobile station, for example the 
source TMS 312 shown in FIG. 2 . At step 404, a second set of vocoder input parameters 
is generated from an output signal 329. The output signal 329 results from the speech 
input signal 319 being transmitted through the digital channel test configuration 200. 

25 Thus, the output signal 329 results from decoding and re-encoding the speech input signal 
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319 as it travels from the source TMS 312, through the source BTS 302, through the 
transcoder 306, through the MSC 308, through the transcoder 306, through the destination 
BTS 304, through the destination antenna 318, and to the destination TMS 314 shown in 
FIG. 2. Next, a first plurality of LSP frequencies 501 is extracted from the speech input 
5 signal 3 19 at step 406, and a second plurality of LSP frequencies 503 is extracted from 
the output signal 329 at step 408. The first plurality of LSP frequencies 501 includes a 
series of m groups often LSP frequencies. Likewise, the second plurality of LSP 
frequencies 503 includes a series of n groups often LSP frequencies. At step 410, a 
multiplicity of frame correlation error metrics is calculated using the first and second 

1 0 plurality of LSP frequencies. 

A more detailed flowchart of the method for calculating the multiplicity of frame 
correlation metrics, step 410, is shown in FIG. 9. Calculating the multiplicity of frame 
correlation errors begins at step 430 where the m group is initialized to the 1st group of 
the first set of LSP frequencies 501 . The m group of the 1st set of LSP frequencies 501 is 

15 then aligned with the 1st group of the second set of LSP frequencies 503 at step 432. At 
step 434, a difference is calculated between each of the 10 LSP frequencies from the m 
group of the 1st set of LSP frequencies 501 and each of the corresponding 10 LSP 
frequencies of the 1st group of the second set of LSP frequencies 503 to form 10 LSP 
difference values. Each of the 10 LSP difference values are squared at step 436, forming 

20 10 squared LSP values. Each of the 10 squared LSP values is multiplied by a 

corresponding weight factor, step 438, forming 10 weighted LSP values. The 10 LSP 
weighted values are summed together at step 440, yielding a sum LSP value. The sum 
LSP value is divided by 10, step 442, yielding a mean LSP value. At step 444, the square 
root of the mean LSP value is calculated, forming one m correlation metric. The next 

25 group of the 1st set of LSP frequencies 501, or the m+1 group, is aligned with the 1st 
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group of the second set of LSP frequencies 503 at step 446. The process of calculating 
another frame correlation error metric is repeated using steps 434-446, thereby yielding 
one frame correlation metric per alignment until the midpoint group of the 1st set of LSP 
frequencies 501 is reached at step 448. Accordingly, a multiplicity of frame correlation 

5 metrics is calculated. 

Referring back to FIG. 7, at step 412, the frame correlation selector module selects 
the frame correlation error metric having the smallest value from the multiplicity of frame 
correlation error metrics, forming a minimum FCE. The minimum FCE is compared to a 
predetermined threshold value at step 414. If the minimum FCE is equal to, or less than a 

10 predetermined threshold value, no fault is detected in the digital channel 200 at step 418. 
If the minimum FCE is greater than the predetermined threshold value, however, a fault 
detection in the digital channel 200 is declared at step 416. 

Referring to FIG. 8, a method for detecting a faulty element of the digital channel 
200 begins at step 420 where a plurality of fingerprint fault values are generated for each 

15 faulty element of the digital channel 200. Preferably, each of the plurality fingerprint 

fault values was previously determined under controlled test conditions. At step 422, the 
minimum FCE value generated by the frame correlation error generator 340 is compared 
to the plurality of fingerprint fault values. The faulty element of the digital channel is 
then determined based on the comparison at step 424. 

20 For example, one type of fault may result from (1 ) the selection of a dead channel 

element in the source BTS 302, causing a "silent" output signal at the destination TMS 
314. The fingerprint fault value expected for a fault of this type may be equivalent to a 
minimum FCE value up to about 50% higher than the predetermined threshold value 
because the output signal would be about 100% silence rather than the output signal 329 

25 expected to be composed of about 50% silence and 50% sound. Another type of fault 
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may result from (2) the selection of a noisy terrestrial circuit by the MSC 160 causing 
additional noise in the output signal. The fingerprint fault value expected for a fault of 
this type may be equivalent to a slightly higher minimum FCE value than (1) and 
therefore may be equivalent to a minimum FCE value approximately 50% higher or 

5 above than the predetermined threshold value a minimum FCE value because the noise 
would be spread through out the output signal. Yet another type fault may result from (3) 
the selection of a weak radio channel between the source BTS 302 and the source antenna 
316 causing a large number of dropped frames in the output signal. The fingerprint fault 
value expected for a fault of this type may be equivalent to a minimum FCE value greater 

1 0 than predetermined threshold value but less then the fingerprint fault value for (1). Thus, 
the experimentally predetermined fingerprint fault values can be compared to the 
minimum FCE value generated by the frame correlation generator 340 to determine the 
faulty element within the digital channel 200. 
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